Modelling Multi- Trait Scale-free Networks by 
Optimization 



Bojin ZHENG^ 2,3* Hongrun WU, Jun Qm^ \ Wenfei LAN 
and Wenhua DU^ * 

1. College of Computer Science, South-Central University for Nationalities, Wuhan 
430074, China 

2. State Key Laboratory of Networking and Switching Technology, Beijing University 
of Posts and Telecommunications, Beijing 100876, China 

3. School of Software, Tsinghua University, Beijing 100084, China 

4. Management School, South-Central University for Nationalities, Wuhan 430074, 
China 

E-mail: zhengbojin@gmail.com 

Abstract. Recently, one paper in A''aiure(Papadopoulos, 2012) raised an old debate 
on the origin of the scale-free property of complex networks, which focuses on whether 
the scale-free property origins from the optimization or not. Because the real-world 
complex networks often have multiple traits, any explanation on the scale-free property 
of complex networks should be capable of explaining the other traits as well. This 
paper proposed a framework which can model multi-trait scale-free networks based on 
optimization, and used three examples to demonstrate its effectiveness. The results 
suggested that the optimization is a more generalized explanation because it can not 
only explain the origin of the scale-free property, but also the origin of the other traits 
in a uniform way. This paper provides a universal method to get ideal networks for 
the researches such as epidemic spreading and synchronization on complex networks. 
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1. Introduction 

Some complex networks present the scale-free property [3j, that is, their degree 
distribution p{k) satisfies p{k) ~ k''' . This phenomenon have been discovered in 
many complex networks, such as the Internet [H HI EI], the World Wide Web [3] and 
the Scientific Cooperation Network[T3]. Owing to the abroad interests, the researches 
on complex network involves more and more disciplines 

Up to date, hundreds of models have been proposed to explain the origin and the 
mechanisms of the scale-free networks. Among them, the most popular model is the 
preferential attachment with the growth (the BA model) [3]. This model says that all 
scale-free networks are evolving; when new nodes emerge, they will link to the previous 
nodes with a linear preference which is proportional to the degrees of the previous nodes. 

Notice that the real-world networks commonly are multi-trait. That is, one network 
can be not only scale-free, but also small-world[17J or has other traits [9l [16]. Therefore, 
every explanation on the origin of the scale-free networks should be capable of explaining 
the other traits, otherwise, it can not be treated as the ultima explanation. Because 
the preferential attachment with the growth only explains the mechanism of the scale- 
free property, and it can not explain the other traits such as the small-world effect and 
fractal structure [1^, researchers developed many explanations based on the preferential 
attachment. 

Actually, in spite of the popularity of the preferential attachment, researchers have 
proposed other explanations, such as the randomness [TO], the optimization [HI [7], the 
hierarchy [151 [19]. For example, Carlson et al. thought that the preferential attachment 
can not explain the Internet and proposed that the HOT (Highly Optimized Tolerance) 
is the right mechanism[6l [7]. 

Recently, an old debate arose again[2]. On one side, Papadopoulos et al. proposed 
the popularity x similarity optimization model to explain the origin of the preferential 
attachment and the scale-free networks[14j; Zheng et al. proposed a dual optimization 
model to explain the origins of complex networks and to clarify their relationships [20] . 
These results show that the scale-free property and the other traits can origin from the 
optimization. On the other side, the BA model still dominates with strong supports 
from evidences and theories. Therefore, whether the scale-free networks origins from the 
the randomness or the optimization? Furthermore, whether the preferential attachment 
origins from the randomness or the optimization? These questions require the answers. 

In this paper, we demonstrate that the power law distribution can be transferred 
into the optimization problems and then propose an optimization framework. Under 
this framework, we can append additional traits such as the small- world effect, high 
clustering coefficient et al. into the resultant scale-free networks. This paper validates 
the optimization explanation on the origin of the scale-free networks and even most of 
complex networks with compatibility to the BA model . Moreover, this paper provides 
a universal method to the generation of the ideal multi-trait networks for the simulation 
experiments in the other researches, such as the invulnerability, epidemic spreading. 



Modelling Multi-Trait Scale-free Networks 



3 



game theory and synchronization on complex networks. 
2. Methods 

In this paper, we transfer the power law distribution into the optimization objective, 
and then add the other traits as the constraints, therefore, the solutions of the obtained 
optimization problem would be scale-free networks with the specific traits. When the 
optimization problem is determined, we can develop an algorithm to solve it, i.e., obtain 
the desired network. 

2.1. The optimization objective 

To generate a scale-free network, there exist many methods, and these methods often 
are regarded as the mechanisms to explain how the scale-free networks origin. From the 
view of optimization, there also are many methods to generate the scale-free property. 
Here we introduce a simple method, that is, treating the scale-free property as the 
optimization objective. 

When treating the scale-free property as the optimization objective, we actually 
optimize the resultant network to make its degree distribution to satisfy the power law 
distribution. Assume that the desired network is undirected and unweighted with N 
nodes, we actually need to make the degrees of nodes to be the samples of the power 
law distribution. In another words, we can generate N ideal samples (see the appendix 
for the method), and then optimize a network to achieve that its node degrees match 
these samples. That is, the optimization objective can be written as equation [1] 



i=l 

Here, A is the adjacent matrix of the network, di is the degree of the i-th node, and 
Si is the value of the i-th sample. 

2.2. The constraints 

The other traits of the desired networks can be treated as the constraints. For example, 
the clustering coefficient plays an important role in complex networks, if we want the 
network has a specific clustering coefficient C, then the optimization problem can be 
expressed as equation [2l 



TV 




(1) 



min g{A) = J2 {di{A) - Sif 



1=1 



(2) 



s.t. 

cc{A) = C 



Here cc is the clustering coefficient function of the optimizing network. 
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Moreover, the average shortest path length is a key traits for complex networks, 
especially, many complex networks present a characteristic value, about ln{N). We can 
express the the average shortest path length to be a constraint as equation [3l 

N 2 

min g{A) = J2 (,di{A) - Si) 

(3) 

y(A) = ln{N) 

Here y is the average shortest path function of the optimizing network. 

Of course, one optimization problem can have multiple constraints. Taking 
the small-world effect as the example, the small-world networks should include two 
constraints, that the average shortest path length is ln{N) and the clustering coefficient 
is medium. Therefore, it can be depicted as equation HJ 

N 2 

min g{A) = J2 {di{A) - s,) 

i=l 

s.t. (4) 

y{A) = ln{N) 
cc{A) = C 

2.3. The optimization algorithm 

Once the form and parameters of the optimization problem are finally determined, we 
can employ an optimization algorithm to solve this problem. 

According to the Lagrangian relaxation methodjl], equation [2] can be rewritten as 
equation |5l 

N 

min g\A) = ^ - Sif + ^(cc(A) - (5) 

i=l 

Here 9 is an arbitrary positive real number. 
Similarly, equation [3] can be rewritten as equation [61 

N 

mm g\A) = ^ - s,f + Oiy - lniN)f (6) 

When the optimization model has multiple constraints, the Lagrangian relaxation 
method needs multiple parameters. So that equation |4] can be rewritten as equation [71 

mtn g\A) = {d,{A) - s,f + d{y - ln{N)f + v?(cc(A) - Cf (7) 

Because the optimization problems are simple, we employ the classic hill-climbing 
algorithm to solve them. Because the hill-climbing algorithm belongs to the iterative 
algorithm, this paper set the iteration number as 100000. 

The algorithm can be depicted as Fig. [H 
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0. Generating the samples; 

Unitializing a connected network A randomly and let count=0; 

2. Calculating g'{A); 

3. while count < 100000 

4. A' = A and change an edge of A' randomly; 

5. if A' is disconnected then jump to 3; 

6. Calculating g'{A'); 

7. if g'{A') < g'{A) 

8. A = A'; 

9. end if; 

a. count = count + 1; 

b. end while; 

c. Output A. 

Figure 1. The pseudocode of the optimization algorithm 

3. Results 

We solved equation Ej [3] and H] with different parameters by simulation. We select 9 
groups of parameter settings that are listed as Table [H Moreover, all the number of 
nodes are set to 300 such that the topology of the resultant networks can be clearly 
visualized. 



Table 1. The parameter settings 



Group 


7 


kmin 


kmax 


E 


6 






C 


(A) 


2 


1 


27 


347 


1 






0.06 


(B) 


2 


1 


27 


347 


1 






0.1 


(C) 


2 


2 


43 


761 


1 






0.6 


(D) 


2 


1 


27 


347 




1 


5.7 




(E) 


2 


2 


43 


347 




1 


5.7 




(F) 


2.4 


2 


30 


559 




1 


5.7 




(G) 


2 


1 


27 


347 


1 


1 


5.7 


0.1 


(H) 


2 


2 


43 


761 


1 


1 


5.7 


0.6 


(I) 


2.4 


2 


30 


559 


1 


1 


5.7 


0.3 



In Table [H / is the expected average shortest path, E is the number of the edges of 
the resultant network, 7 is the expected exponent of the power law distribution, kmax 
is the maximal value of node degrees. 

In the experiments, we carried out 30 times for every parameter setting. The 
corresponding statistical results are listed as Table El 

In Table [21 7' is the exponent of the power law distribution of the resultant network, 
Avg means the average value and STD means the standard deviation. 
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Table 2. The corresponding results 



Group 


Avg{-f') 


STD{i) 


Avg{y) 


STD{y) 


Avg{cc) 


STD{cc) 


(A) 


2.05692 


0.00751 


- 


- 


0.06000 


3.60E-07 


(B) 


2.08833 


0.00389 


- 


- 


0.10001 


7.78E-07 


(C) 


2.15800 


0.00389 






0.59510 


0.00969 


(D) 


2.04800 


0.00315 


5.69990 


1.29E-05 






(E) 


2.10400 


0.01888 


5.70000 









(F) 


2.46100 


0.00447 


5.70000 









(G) 


2.10500 


0.00548 


5.68160 


0.04635 


0.09535 


0.00603 


(H) 


2.15000 


0.00408 


5.69439 


0.00408 


0.59442 


0.00283 


(I) 


2.53800 


0.00447 


5.70001 


3.13E-05 


0.29822 


0.00371 



From Table [21 we can see that the experimental results are satisfactory. The 
indicators of the traits of the resultant networks approximate to the desired value. 

For each parameter setting, we chose one resultant network shown as Fig. |2] to [TOl 




Figure 2. The topology of the selected network from group (A) 



Comparing Fig. [2] and [31 we can see that when the clustering coefficient increases, 
the edges tend to congregate to the center. 

Comparing Fig. [2] to [11 we can see that when the kmin increases, the clustering 
coefficient increases rapidly. 

Comparing Fig. [5] and [6l we can see that when the kmin increases, the topology 
of the desired networks change greatly. The edges congregate to the center with long 
loops around. 

Comparing Fig. [6] and [3, the center has more dense edges, and the loops are longer. 
Comparing Fig. [8] to [TOl we can see that when the kmin and 7 change, the small- 
world networks have very different topological structures. 
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Figure 3. The topology of the selected network from group (B) 




Figure 5. The topology of the selected network from group (D) 
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Figure 6. The topology of the selected network from group (E) 




Figure 7. The topology of the selected network from group (F) 




Figure 8. The topology of the selected network from group (G) 
4. Discussion 

This paper demonstrated the abihty of the proposed framework to generate the desired 
multi-trait networks. According to the optimization theory, the Lagrangian relaxation 




method can deal with multiple constraints, therefore, this framework can be easily 
extend to obtain the networks with more than 2 traits. For example, we can generate a 
scale- free network with specific average shortest path and clustering coefficient. 

Because the proposed framework based on optimization can generate multi-trait 
networks, we can say that the optimization is an explanation to the origin or the 
mechanism of the scale-free networks with some traits, this framework only focus on the 
final forms of the power law distributions, therefore, no matter what are the origin and 
the mechanisms of the scale-free property, they can be easily integrated in. Furthermore, 
it has been proved that the scale-free property can be obtained by the optimization, so 
scale-free networks with arbitrary traits can be expressed as the optimization problems. 
That is, the optimization is a universal explanation on the origins of scale-free networks. 

As to the BA model, it is compatible of the proposed framework, because it can be 
used only to explain the generation of the scale-free property. 

Besides, this paper employed a classic algorithm to solve the optimization problems. 
If better the-state-of-the-art algorithms are used, the experimental results are expected 
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to be better. 



5. Conclusion 



This paper proposed a general framework to obtain multi-trait scale-free networks. This 
framework firstly transfers the power law distribution into the optimization objective 
and the other traits as the constraints, so obtains an optimization problem, and then 
employs an optimizer to solve this problem, finally obtains the desired multi-trait 
network. Taking three examples, this paper also demonstrated how to apply this 
framework to generate the desired network. From the experimental results, we also 
found that the small-world effect actually means very different topological structures. 

Because the proposed framework can uniformly explain the origin of complex 
networks with multiple traits, this paper provided a perspective on the origin and 
the mechanisms of complex networks. Especially, this paper can easily integrate the 
BA model or the optimization explanations into the proposed framework, i.e., this 
framework is compatible of the BA model and the optimization. 

Moreover, this paper can be used to obtain ideal topology of multi-trait complex 
networks which would be necessary in the researches on the invulnerability of the 
complex networks 1 1 2 [ [T| [18] . the synchronization, the control, the game and the epidemic 
spreading on complex networks [HI [5]. 



6. Appendix 

When samples are generated, they are expected to satisfy the power law distribution 
as possible as they can, especially when N is not large enough. Therefore, this paper 
suggests a method to generate the samples. 

Notice the continuous version of the power law distribution, shown as equation ([H]), 
the power law distribution depends on two parameters, the kmin and 7. Here kmin is 
the minimum value of the node degrees. 

^ kmin ^kmirij ^ ^ 

When kmin and 7 are determined, we can calculate the expected occurrence of 
every k. Although k can be any value between A — 1 and kmin, in most circumstance, 
the probability is quite small when k is large, therefore, this paper suggest a maximum 
value kmax and set the probability of A; as when k is larger than kmax. Hence, the 
power law distribution can be written as equation [91 

Pik) = (9) 

k=kmin 

According to equation [HI we should determine kmax. By calculating the expected 
occurrence for each k, we can get the probability values for the degrees. If the expected 
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occurrence of certain degree is smaller than a threshold 0.3, then we truncate it, so that 
we get the kmax. Then we recalculate p{k) and round the expected of each k and get 
the samples. 
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